P-values for classification
نویسنده
چکیده
Let (X,Y ) be a random variable consisting of an observed feature vector X ∈ X and an unobserved class label Y ∈ {1, 2, . . . , L} with unknown joint distribution. In addition, let D be a training data set consisting of n completely observed independent copies of (X,Y ). Usual classification procedures provide point predictors (classifiers) Ŷ (X,D) of Y or estimate the conditional distribution of Y given X. In order to quantify the certainty of classifying X we propose to construct for each θ = 1, 2, . . . , L a p-value πθ(X,D) for the null hypothesis that Y = θ, treating Y temporarily as a fixed parameter. In other words, the point predictor Ŷ (X,D) is replaced with a prediction region for Y with a certain confidence. We argue that (i) this approach is advantegeous over traditional approaches and (ii) any reasonable classifier can be modified to yield nonparametric p-values. We discuss issues such as optimality, single use and multiple use validity, as well as computational and graphical aspects.
منابع مشابه
TARGETING CUSTOMERS: A FUZZY CLASSIFICATION APPROACH
Nowadays, marketing serves the purpose of maximizing customer lifetime value (CLV) and customer equity, which is the sum of the lifetime values of the company’s customers. But, CLV calculation encounters some difficulties which limit the usage of this technique. Nonetheless, companies looking for methods to know how to calculate their customers’ CLV. In this paper, fuzzy classification rules we...
متن کاملA Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images
Convolutional neural network is one of the effective methods for classifying images that performs learning using convolutional, pooling and fully-connected layers. All kinds of noise disrupt the operation of this network. Noise images reduce classification accuracy and increase convolutional neural network training time. Noise is an unwanted signal that destroys the original signal. Noise chang...
متن کامل25(OH) vitamin D serum values and rheumatoid arthritis disease activity (DAS28ESR)
Background: The role of vitamin D in the pathogenesis of rheumatoid arthritis is under investigation. This study was designed to evaluate the correlation between serum values of 25(OH) vitamin D [25(OH)D] and disease activity in rheumatoid arthritis (RA) patients according to Disease Activity Score 28 joints and ESR (DAS28ESR). Methods: Ninety-nine patients according to ACR classification crit...
متن کاملDetection and Classification of Heart Premature Contractions via α-Level Binary Neyman-Pearson Radius Test: A Comparative Study
The aim of this study is to introduce a new methodology for isolation of ectopic rhythms of ambulatory electrocardiogram (ECG) holter data via appropriate statistical analyses imposing reasonable computational burden. First, the events of the ECG signal are detected and delineated using a robust wavelet-based algorithm. Then, using Binary Neyman-Pearson Radius test, an appropriate classifie...
متن کاملAutomatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique
The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...
متن کاملShortcut Node Classification for Membrane Residue Curve Maps
comNode classification within Membrane Residue Curves (M-RCMs) currently hinges on Lyapunov’s Theorem and therefore the computation of mathematically complex eigenvalues. This paper presents an alternative criterion for the classification of nodes within M-RCMs based on the total membrane flux at node compositions. This paper demonstrates that for a system exhibiting simple permeation behaviour...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008